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Introduction 

This  report  discusses  problems  resolved  in  the  processing  (including  capture)  of  the  1993 
National  Census  Test  (NCT)  survey.   It  may  help  in  the  planning  and  development  of  any 
future  Census  Test,  assuming  the  methodology  of  the  Census  to  continiue  as  a  self- 
completion,  mail-back  paper-and-pencil  form.  On  the  other  hand,  the  NCT  was  adapted  to 
1993  Labour-Force  Survey  (LFS)  procedures;  as  the  LFS  moves  to  CAI  (Computer-Assisted 
Interviewing)  some  of  our  problems  may  be  unique  to  the  1993  test. 

Field  Operations  -  Background 

The  NCT  used  a  November  8,  1993  reference  date.  Questionnaire  drop-off  started  on 
Saturday  October  30  and  fmished  on  Friday,  November  5,  1993. 

One  component  of  thie  test  was  selection  of  a  subsample  of  NCT  households'  whose 
responses  were  captured  as  the  "Edit  Failure  Survey"  (EFS)  component  of  the  survey  as  soon 
as  received  in  the  ROs,  then  sent  back  to  interviewers  for  field  edits  and  follow-up.   These 
questionnaires  were  re-captured  in  order  to  test  the  field  edits  and  follow-up. 

Startup  of  data  capture  of  EFS  responses  was  delayed  a  day  or  two  because  of  the  November 
11th  holiday  (Thursday)  and  problems  encountered  with  the  capture  program  written  in  the 
DC2  software.   The  first  transmission  was  received  from  the  Edmonton  RO  on  Saturday, 
November  13,  1993.   Data  collection  was  completed  for  EFS  responses  by  November  30, 
1993.   The  raw  EFS  file  consisted  of  10,145  person-records. 

Data  capture  of  NCT  responses  started  on  December  10,1993  and  the  final  transmission  was 
received  on  January  26,  1994.   The  raw  NCT  file  (combined  LFS-based  and  special 
populations)  consisted  of  47,057  person-records. 


Print  Requirements 

Labels:  Labels  and  interviewer-assignment  control-lists  were  generated  through  the  LFS  and 
had  to  fit  in  with  the  LFS  production  facilities.   The  household  identification  code  on  labels 


'Sample  selection  was  completed  on  the  Mainframe  by  Mike  Egan.   The  main  sample 
was  selected  as  a  subset  of  the  LFS  sample  from  April,  May  and  June  of  1991,  which 
"rotated  out"  of  (i.e.,  finished  with)  the  LFS  in  September  -  November  1991.   It  was  judged 
that,  two  years  after  LFS  participation,  the  households  would  not  have  retained  any  bias  in 
their  attitudes  from  the  LFS  experience  that  would  significantly  affect  their  NCT  responses. 
A  total  of  17,109  dwellings  was  selected.   The  EFS  sample  was  1/2  of  the  NCT  sample,  i.e., 
8500  dwellings.   A  further  3985  dwellings  were  picked  for  "special  population"  samples, 
primarily  from  1991  Census  files. 


and  control  lists  was  15  bytes  long,  consisting  of  the  LFS-file  fields  PSU,  GROUP, 
CLUSTER,  ROTATION,  LISTING  and  MULTIPLE.    Only  fourteen  bytes  of  identifying 
information  were  generated  in  the  label  program  instead  of  fifteen,  with  'multiple'  not  being 
printed.   To  resolve  this  problem  regional  offices  were  instructed  to  code  '0'  in  'multiple'  on 
the  labels  and  control  lists.   In  spite  of  the  instructions,  'multiple'  was  still  blank  for  many 
cases  on  the  assignment  control  lists.   This  was  problematic  because  they  were  to  be  linked 
to  household-related  responses  from  the  main  questionnaire  to  create  a  Household  file.   To 
recover  the  multiple  a  match  to  the  sample  file  to  pick  up  this  field  was  performed.   All 
multiple  codes  of  1  or  higher  were  selected  and  assigned  manually  on-line. 

All  linkage  specifications  used  a  20-byte  id  consisting  of  the  Interviewer  Assignment  Number 
(IAN),  PSU,  group,  cluster,  rotation,  listing  and  multiple.   With  hindsight,  it  would  have 
been  better  to  have  used  a  shorter  household-id  code  similar  to  the  LPS  RO-DOCKET. 
Linking  on  a  household  identifier  of  twenty  bytes  was  long,  cumbersome  and  error-prone. 

During  assignment  planning,  ROs  were  asked  to  assign  "7"  as  the  second  digit  of  the  IAN  if 
the  dwelling  was  picked  for  the  LFS-based  sample,  and  "9"  for  special-population  samples^ 
In  general,  this  worked  and  made  it  readily  possible  to  separate  and  order  the  assignments 
for  printing  labels  and  assignment  control  sheets.   However,  there  were  a  few  errors  which 
suggests  that  clean-ups  must  be  done  early  in  the  initial  stages  of  processing  in  order  to 
verify  the  IAN. 

Furthermore,  to  accomodate  the  IAN  planning  (R21  files)  there  needs  to  be  adequate  lead 
time  to  allocate  lANs.  These  are  allocated  by  PSU,  GROUP  and  CLUSTER  for  the  LPS 
rotate-out  sample.   We  allowed  three  months  for  the  completion  of  assignment  planning  in 
the  regional  offices. 

A  problem  arose  in  the  Special  Population  sample.   Some  EAs  exceeded  75  households 
which  was  the  maximum  allocation  for  one  assignment.   In  order  for  the  ROs  to  determine 
how  to  break  the  EA  down  into  meaningful  geographical  delineations  at  the  household  level 
the  household  id  was  carried  on  the  LOl  file.   A  hardcopy  listing  with  address  information 
was  provided  for  each  RO  to  cross  reference  with  the  household  id  given  on  the  LOl  file. 
For  EAs  that  were  too  large  for  one  assignment  '01'  was  put  in  the  yield'.   Fortunately,  none 
of  these  procedures  had  an  impact  on  the  instruction  manuals  written  by  Survey  Operations. 


^A  total  sample  of  3985  households  was  selected  using  1991  Census  Visitation  Records  to 
locate  EAs  with  high  concentrations  of  the  special  populations  of  interest.   The  Winnipeg  and 
Saskatoon  Metis  samples  were  obtained  from  provincial  membership  association  lists.   These 
households  were  in  addition  to  the  LFS-based  main  NCT  sample. 

'Yield  is  a  required  field  according  to  Labour  Force  procedures  for  updating  assignments 
in  the  ROs 


The  Special  Population  samples  were  not  weighted  because  they  are  not  representative 
samples  of  the  population  in  general,  and  national  estimates  will  not  be  calculated  from  the 
information  obtained.   The  major  purpose  of  these  samples  is  to  evaluate  the  Census  Test 
Questionnaire  for  specific  groups*  that  may  experience  particular  problems  with  the  proposed 
questions. 

If  there  is  to  be  a  Special  Population  component  in  future  Census  tests  then  careful  attention 
is  required  in  the  preparation  of  the  "F03"  files  needed  by  LFS  (John  Rowland)  as  input  to 
the  "S03"  print  files  needed  by  ISD  (Kathy  Reid/  Dave  Bowman).   Extra  processing  was 
required  to  reformat  Special  Population  records  into  an  "F03"  structured  response  file.  This 
file  was  required  by  the  LFS  to  create  an  "SOS"  print  file  for  the  production  of  NCT  labels 
and  control  lists.   For  the  two  M6tis  samples  selected  from  membership  association  lists  we 
had  to  make  up  dununy  Prov-FED-EA-Hhld  numbers.   The  proper  Province  code  was 
assigned,  '000000'  for  FED-EA  and  then  Hhld  number  was  sequentially  assigned 
commencing  at  '001'.   Group  and  Rotation  were  recoded  to  zeros.   The  LFS  "F03  Short" 
record  length  is  280  bytes  but  the  Special  Population  mock  F03  was  created  as  131  bytes. 
The  LFS  F03  file  expects  a  record  length  of  372  so  the  Special  Population  file  was  zero- 
filled  to  meet  this  requirement. 

Two  problems  arose  with  the  Special  Populations.   The  F03  interviewer  assignment  numbers 
(lANs)  were  not  moved  over  correctiy  and  consequentiy  did  not  match  Dave  Bowman's  file. 
This  was  resolved  in  the  program  that  merged  the  F03  and  R21  (assignment  planning)  files. 
The  LFS  system  required  a  current  IAN  as  well  as  a  previous  IAN  to  be  placed  on  the  F03 
file  in  two  consecutive  places.    Secondly,  it  was  decided  that  Edmonton  was  to  handle  the 
assignment  planning  for  Winnipeg.  This  complicated  the  print  file  prq)aration  so  that 
Edmonton  could  print  Winnipeg's  labels  by  requiring  a  matching  on  PSU,  Listing  number 
and  last  byte  of  IAN  for  RO  16  and  RO  17  data. 

A  few  days  were  lost  during  production  to  produce  a  print  file.  Some  more  fields  on  the  372- 
byte  F03  were  identified  that  should  have  specific  values*  (see  footnote  below  for  future 
reference). 


^Blacks  in  Halifax,  Asians  in  Montreal,  Blacks  in  Montr^,  Latin  Americans  in 
Montr^,  Asians  in  Toronto,  Blacks  in  Toronto,  Aboriginals  in  Winnipeg,  M6tis  in 
Winnipeg,  M^tis  in  Saskatoon,  Aboriginals  in  Regina,  Aboriginals  in  Edmonton  and  Asians 
in  Vancouver 

'  survey  id  (pos.25,1)  =  1 

preprinted  code  (pos.52,l)  =  l  (=0  if  special  populations) 

pos.  54,8  =  blank  (flags  based  on  a  previous  month  -  make  special  populations  look  like 
births) 

pos.  280,1  =  0  (there  are  no  notes) 

pos.  266  =  1  (if  flagged  for  EFS  or  blank  on  special  populations) 


Data  Capture 

The  start  of  EFS  data  capture  was  delayed  three  or  four  days  mainly  because  of  bugs  in  the 
DC2  software  being  used  for  the  NCT.   Due  to  time  constraints  Special  Surveys  did  not  have 
an  opportunity  to  review  keyer  instnictions.    One  instraction  called  for  household  data  (Steps 
1  -  7  and  QQ47  onwards)  to  be  captured  only  once  as  a  part  of  the  person-l  record.   In 
several  cases,  household  data  were  captured  for  somebody  other  than  person  1*.   An 
ambiguity  in  the  instructions  may  have  been  the  reason.  It  would  have  been  desirable  to  have 
reviewed  and  commented  on  the  instructions. 

In  using  DC2,  there  was  no  means  of  control  to  guarantee  all  household  members  had  been 
captured.   DC2  was  capable  of  controlling  for  the  number  of  forms  but  there  was  no  way  of 
knowing  whether  a  key  operator  had  missed  out  an  entire  person  from  the  form.   A  higher 
level  of  verification  than  was  used  would  be  helpful. 

The  design  of  the  questionnaire  was  set  up  for  vertical  capture  to  reflect  the  capturing  of 
person  records  within  the  household.   No  colour  distinction  was  made  on  the  form  to  assist 
the  key  operator  visually  to  stay  in  the  correct  column  on  each  page.   A  recommendation  for 
shading  should  be  made  if  the  budget  can  handle  the  additional  cost  for  printing. 

Another  limitation  of  the  DC2  software  was  that  it  lacked  verification  flexibility.   For 
example,  100%  of  a  given  field  had  to  be  verified  by  re-keying  for  all  forms.  It  was  not 
possible  to  verify  a  sample  of  forms.  It  would  have  been  preferable  to  have  taken  a 
subsample  of  documents  and  verified  on  several  or  all  fields.   If  this  option  had  been 
available  a  better  picture  of  the  error  rate  incurred  by  the  key  operator  would  have  been 
evident.  For  budget  reasons  we  chose  to  verify  the  20-byte  Household  identifier  and 
questions  2-5  for  every  form.    We  would  advise  with  hindsight  a  higher  level  of 
verification. 

Processing 

Since  a  major  purpose  of  the  test  was  to  measure  errors,  the  capture  program  was  written 
and  costed  to  jJlow  for  multiple  entries  for  all  precoded  questions,  including  those  with 
instructions  "mark  one  only".   It  is  questionable  whether  subject  matter  people  were 
interested  enough  in  multiple-response  errors  to  make  this  worth  while.   Developing  and 
testing  the  data  capture  entry  system  took  many  person-days  more  than  if  Special  Surveys' 
standard  processing  practice  had  been  used,  of  capturing  the  first  response  where  only  one  is 
expected.  Future  tests  might  also  use  RO  facilities  for  grooming  before  capture. 

A  shortage  of  resources  during  the  processing  was  experienced.   Planners  and  managers  of 
any  future  Census  test  will  want  to  be  assured  of  having  adequate,  qualified  programmers  to 


*This  occured  primarily  in  the  Montr6al  RO. 
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handle  complex  programming  requirements'.   To  complete  the  testing  and  production  runs 
for  the  NCT  three  experienced  programmers  were  found  at  short  notice  in  January  1994, 
each  available  for  a  few  weeks  only.   Each  programmer  was  assigned  his  own  tasks.   What 
was  being  tested  or  produced  and  by  whom  required  hour-to-hour  attention  and  direction 
from  a  coordinator*. 

Attention  needs  to  be  paid  to  the  creation  and  availability  of  detailed  test  flies.   During  the 
phase  of  program  development  for  derived  variables  programmers  wanted  good  test  fUes. 
Because  of  the  nature  of  the  DVs  these  test  files  are  best  supplied  by  subject  matter  persons 
who  are  responsible  for  specifications  for  the  DVs.   Early  advisement  should  be  given  to 
subject  matter  in  order  that  they  can  prepare  for  this  part  of  the  processing. 

Working  on  the  same  platform  would  have  been  advantageous  in  terms  of  location  and 
management  of  file  creation.   For  example,  all  of  the  processing  could  have  been  handled  on 
our  own  LAN,  or  on  the  Census  LAN  (UNIX)  or  on  the  Mainframe.   In  actuality,  there  was 
a  lot  of  uploading  and  downloading  of  production  files  in  order  to  accomodate  the  two 
working  environments  used  for  the  1993  NCT*.   From  the  creation  of  the  RAW  files  to  the 
PREDIT  files  the  processing  was  handled  on  the  Mainframe.   Afterwards  the  processing  was 
all  done  on  the  UNIX,  except  for  one  of  the  short-term  borrowed  programmers  who 
preferred  to  work  on  the  mainframe.   At  times  the  Unix  was  a  bit  slow  due  to  maximum 
user  capacity  or  space  limitations.   Presumably  there  are  cost  savings  in  using  a  LAN  such  as 
the  Census  Unix,  although  it  might  be  difficult  to  determine  just  how  much. 

Other  than  for  income  data  (Q.46  in  the  test)  there  was  no  editing  of  results  by  Special 
Surveys  after  capture.   For  future  tests,  from  a  capture  point  of  view,  consideration  should 
be  given  to  omitting  the  income-question  cents  boxes,  or  to  retaining  the  background  colour 
in  them  (i.e.,  not  dropping  them  out).   They  added  to  the  incidence  of  error  despite  key 
operators  being  instructed  not  to  capture  the  cents.   Perhaps  income-question  instructions 
could  be  tested:  "enter  the  amount  to  the  nearest  dollar".   If  future  tests  include  editing  the 
income  data,  verification  on  this  field  should  be  included,  and  there  should  be  more  edits 
included  in  the  capture  system.   About  four  programmer-weeks  were  used  in  the  test  just  to 
edit  Q46  which  became  an  expensive  and  time-consuming  aspect  of  overall  programming. 

To  satisfy  LFS  weight-correction  procedures,  every  record  in  the  main  LFS-sample-based 
results  file  had  to  have  an  age  value.   Year  of  birth  was  imputed  where  missing,  then  age 


'The  NCT  Operational  team  in  Special  Surveys  consisted  of  PWl  Stevens  -Manager, 
Neelam  Prakash  -  Programmer  (replacement  for  Mike  Egan  in  September,  1993),  and  Lorie 
Shinder  -  Processing  Rep. 

'Evelyn  Ryan  of  Census  Operations  controlled  and  monitored  the  assignment  of  tasks  of 
the  three  programmers. 

^Processing  was  developed  and  run  on  both  the  Mainframe  as  well  as  the  UNIX. 
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derived  from  it.   Age  was  then  copied  to  the  EFS  file.    The  EFS  sample  was  a  subset  of  the 
NCT  LFS-based  sample,  with  data  for  each  person  captured  twice  -  once  before  field  edits 
and  follow-up  in  the  EFS  file,  then  after  edits  and  follow-up  in  the  NCT  file.  However, 
some  records  in  the  EFS  file  were  not  captured  in  the  main  NCT  file,  because  not  returned 
in  time  for  NCT  capture,  or  lost  in  the  mail.  In  the  step  of  copying  age  to  the  EFS  file  from 
the  NCT  file,  an  "unknown"  value  was  given  to  EFS  records  not  in  the  NCT  file.   Some  of 
these  EFS  records  nevertheless  had  valid  year-of-birth  data.  It  might  have  been  desirable  to 
allow  for  separate  derivation  of  age  for  these  records.    If  future  tests  follow  the  same 
methodology  for  an  EFS  component,  this  step  should  be  taken  into  consideration. 

A  detailed  plan  of  processing  steps  needs  to  be  drafted  as  weU  as  documented  to  provide  an 
overview  of  the  tasks  at  hand  and  in  the  proper  sequence.      For  example,  after  the  creation 
of  many  of  the  derived  variables  the  NCT  team  became  aware  that  the  temporary  and  foreign 
residents  (i.e.,  those  checking  Step  4  or  Step  6  of  the  questionnaire)  should  have  been 
dropped  from  the  NCT  and  EFS  files  at  earlier  stages  of  processing,  when  duplicate  and 
empty  records  were  dropped.   This  also  impacted  on  the  coding  because  the  write-ins  had 
been  split  off  and  sent  for  coding  before  temporary  and  foreign  residents  were  dropped.   At 
the  time  of  code  linkage  we  had  more  codes  than  we  could  initially  account  for,  until  we 
recalled  that  the  extras  were  from  the  dropped  temporary  and  foreign  residents.    Up-to-date 
documentation  is  important^". 


Autocoding 

An  unanticipated  component  was  the  request  by  subject  matter  to  allow  for  more  than  one 
code  to  be  returned  for  a  multiple  response.    One  characteristic  of  the  ACTR  system  is  that  it 
can  only  provide  one  code  per  write-in.   Multiple  responses  for  Ethnic  origin  (Q16)  and 
Language  (Q09  and  QU)  were  resolved  manually  by  subject  matter.   Using  SAS,  Special 
Surveys  produced  hardcopy  to  assist  with  the  manual  resolution.   It  contained  additional 
information  from  other  questions  for  the  respondent  as  well  as  write-ins  from  other  members 
of  the  household. 

It  would  be  helpful  if  Subject  Matter  people  were  to  decide  in  advance  if  they  wanted  to 
include  more  than  one  code  from  write-ins,  and  if  so  the  maximum  number  of  entries,  and 
how  to  handle  situations  which  exceed  that  limit.  In  addition,  they  could  specify  their 
requirements  for  additional  information  to  aid  coding  early  in  the  planning  for  coding  and 
whether  or  not  a  hardcopy  or  machine  readable  format  is  acceptable. 

Resolution  of  multiple  response  might  be  handled  better  by  assigning  a  unique  code  for  a 
given  multiple  during  initial  coding  and  then  resolving  these  codes  after  the  linkage 
processing.   The  reasons  would  be  twofold:   first.  Special  Surveys  could  accurately  verify 
code  linkage  back  to  the  NCT  and  EFS  files  and  second.  Subject  Matter  would  have  an 


^"Attached  are  the  NCT  and  EFS  processing  flowcharts. 
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opportunity  to  look  at  all  codes  before  deciding  the  appropriate  course  of  action  (e.g., 
whether  to  recode  to  more  than  one  code  or  hot). 

The  multiple  response  problem  will  continue  to  be  problematic  with  the  use  of  open-ended 
questions.   If  this  style  of  questioning  is  used  for  the  1996  Census  or  the  next  NCT  further 
system  development  will  be  required  to  handle  the  resolution  of  these  cases. 


APPENDIX 


Processing  Flow 
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EFS     DATAFLOW 


UNIX  PROCESSING 


EFS  RAW  FILE 

EFS  BRANCH 

STEP  01: 

SAS  PROGRAM:  CAPS/COD/PROO/EFS/VLDPID.PGM 

SAS  LOG:  CAPS/COO/PROD/EFS/VLDPID.LOG 

SAS  OUTPUT:  CAPS/COO/PROD/EFS/VLDPID.OUT 

SAS  CARD ( S ) : 

INPUT  FILE(S):  CAPS/C0D/PR00/EFS/RAU1728. COMPLETE. DAT. 00. DECOI     (10,145) 

OUTPUT  FILE(S):  CAPS/COO/PROO/EFS/VLDPID. RAU1728.DAT. 10. 0EC10      (10,090) 


STEP  02: 


SAS  PROGRAM:  CAPS/COO/PROD/EFS/RAUNDU.PGM 

SAS  LOG:  CAPS/COO/PROO/EFS/RAUNDU.LOG 

SAS  OUTPUT:  CAPS/COD/PROO/EFS/RAUNDU.OUT 

SAS  CARD(S): 

INPUT  FILE(S):  CAPS/COD/PROD/EFS/VLDPID. RAU1728.DAT. 10. DEC10 

OUTPUT  FILE(S):  CAPS/COO/PROO/EFS/RAWNDU. RAW1728.DAT. 11 .MARIO 


(10,090) 
(10,087) 


STEP  03: 


SAS  PROGRAM:  CAPS/COO/PR0O/EFS/NONTEMP2.PGM 

SAS  LOG:  CAPS/C0O/PR0O/EFS/NONTEMP2.LOG 

SAS  OUTPUT:  CAPS/C0D/PR00/EFS/N0NTEMP2.0UT 

SAS  CARD(S): 

INPUT  FILE(S):  CAPS/COO/PROO/EFS/RAWNDU. RAU1728.DAT. 11, MARIO 

OUTPUT  FILE(S):  CAPS/COO/PROO/EFS/NONTEMP. RAW1728.DAT. 03. MAR16 


(10,090) 
(10,087) 


EFS  BRANCH 
MAINFRAME  PROCESSING: 


FIRST  PROGRAM  (STEP  1):  SPEC.ICT9311.SRCECSCRT) 

INPUT  FILE(S):  SPEC.ICT9311.RAUPH1.ESSDEC01 

OUTPUT  FILE(S):  SPEC. ICT9311.RAWPH1.ESSDEC01. SORTED 
Irecl:  1728,  blksize:8640 


#  RECORDS:  10,145 

#  RECORDS:  10,145 
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(STEP  21:  SPEC.ICT9311.SRCE(DUPSEQ2) 

INPUT  FILECS):  SPEC. ICT9311.RAUPH1.ESSDEC01. SORTED 

OUTPUT  FILE(S):  SPEC.ICT9311.RAWPH1.DUPS 

:  SPEC. ICT9311.RAWPH1. UNIQUE 
Irecl:  1728,  blksize:8640 


#  RECORDS:  10,145 

#  RECORDS:  0 

#  RECORDS:  10,145 


(Program  is  checking  &  producing  output  file  with  duplicate  records) 


SECOND  PKOGRWt;     SI»EC.ICT9311.SRCE(FSPLIT) 

INPUT  FILE(S):  SPEC. ICT9311 .RAWPH1 .ESSDEC01  * 

•  DOUNLCNDED  TO  UNIX:      /PR0O/EFS/liAU1728.C0MPLETE.DAT.00.DEC01 

OUTPUT  FILE(S):  SPEC.ICT9311.RAWESS.PERSID00 

:  SPEC.ICT9311.RAUESS.VLDPERID  * 
Irecl:  1728,  blksize:8640 

*  DOIMLOHDED  TO  UNIX:  /PR0D/EFS/VLDPID.IiAU1728.DAT.01.0EC10 


(Program  is  spliting  records  with  PERSONID  =  '00'  ) 


PREDIT  PROGRAH;  FINAL  FILE  BEFORE  BOIMG  INTO  PREDIT 


#  RECORDS:   10,145 

#  RECORDS:  55 

#  RECORDS:  10,090 


THIRD  PROGRAM:  SPEC.ICT9311.SRCECC0MPEDITI 

INPUT  FILE(S):  SPEC.ICT9311 .RAWESS.VLDPERID 

OUTPUT  FILE(S):  SPEC. I CT9311.EFS. PREDIT. NONTEXT 

Irecl  =  92,  blksize  =  8832 

:  SPEC.ICT9311.EFS.PREDIT.TEXT 
Irecl  =  532,  blksize  =  8512 

(PL1  program  is  going  through  the  PREDIT  step  and  creating  2  output  files; 
(1)...TEXT  file,  Irecl  =92,  blksize  =  8832 
(2)... NON-TEXT  file,  Irecl  =  532,  blksize  =  8512 


#  RECORDS:  10,090 

#  RECORDS:  10,090 

#  RECORDS:   9,551 


FILE  BEIWG  PREPARED  FOB  AUTOCODIHG: 


SUBSEQUENT  PROGRAHCSl  TO  CREATE  TEXT  FILES  FOR  AUTOCODING 

PROGRAH  1:  SPEC.ICT9311.SRCE(ALLTEXT) 

INPUT  FILE(S):  SPEC.ICT9311.RAWPH1.ESSDEC01 
IRECL:  1728,  blksize  =  8640 

OUTPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.ALLTEXT 
LRECL:  794,  blsize  =  8734 


#  RECORDS:  10,145 


#  RECORDS:  10,145 


(Program  is  picking  up  all  fields  where  text  is  and  producing  1  big  text  file  of  794  bytes  which  will  be 
split  in  subsequent  program) 


PROGRAH  2  :  SPEC.ICT9311.SRCE(SMALLTXT) 
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STEP  A:  SORT 

INPUT  FILE: 


SPEC.ICT9311.PF.RAWPH1.ALLTEXT 
LRECL:  794 


#  RECORDS:  10,145 


72-BYTE  OUTPUT  FILE(S): 


SPEC, 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 
SPEC 


ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 
ICT9311 

,ICT9311 
ICT9311 

,ICT9311 


.PF.RAUPHI 
.PF.RAUPH1 
.PF.RAWPH1 
.PF.RAUPHI 
.PF.RAWPH1 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 
.PF.RAUPHI 


.Q09TX1 

.Q09TX2 

.Q09TX3 

.Q11TXT 

.Q12TXT 

.Q13TXT 

.Q16TXT 

.Q18TX1 

.Q16TX2 

.Q16TX3 

.Q19TXT 

.Q24TXT 

.Q41TXT 

.Q22TXT0C 

.Q22TXTIC 


97-BYTE  OUTPUT  FILE: 
SPEC.ICT9311.PF.RAUPH1.Q22TXTIC 


163-BYTE  OUTPUT  FILE: 
SPEC. ICT931 1 .PF.RAUPHI .Q42TXT 

#  RECORDS  ON  EACH  OUTPUT  FILE:  10,145 


NOTE:   Each  of  the  above  filet  created,  it  tplit  into  two  filw;  non-blank  &  blanlci;   non-blanlo  are  lant  for 
autocodinn. 


STEP  B1:  SPLIT  FOR  009  TEXT  1  (KNOULEDGE  OF  LANGUAGE) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAUPH1.Q09TX1 
LRECL:  72,  BLKSIZE:  8856 

OUTPUT  FILE(S):  SPEC. ICT9311. PF.RAUPHI .Q09TX1.N0NBLIC  • 
LRECL:  72 
*  FILE  SENT  FOR  AUTOCOOIHG 

:  SPEC. ICT931 1 .RAUPH1 .Q09TX1 .BLANKS 


#  RECORDS:  10,145 


#  RECORDS:  902 


#  RECORDS:     9,243 


STEP  B2:     SPLIT  FOR  009  TEXT  2  (iOIOULEDGE  OF  LANGUAGE) 

INPUT  FILE<S):     SPEC.ICT9311.PF.RAUPH1 .Q09TX2 
LRECL:     72 

:     SPEC.ICT9311.PF.RAUPH1.009TX2.NaNBLIC  * 

LRECL:  72 
*  FILE  SENT  FOR  AUTOCODING 


#  RECORDS:  10,145 


#  RECORDS:  102 
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SPEC. ICT9311 .RAWPH1 .Q09TX2. BLANKS 


#  RECORDS:  10,043 


STEP  B3:  SPLIT  FOR  Q09  TEXT  3  (KNOULEDGE  OF  LANGUAGE) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q09TX3 
LRECL:  72 

SPEC.ICT9311.PF.RAUPH1.a09TX3.NaNBLK  * 
LRECL:  72 
*  FILE  SENT  FOR  AUTOCCDING 

:  SPEC. ICT9311 .RAUPH1 .Q09TX3. BLANKS 


#  RECORDS:  10,145 


#  RECORDS:   24 


#  RECORDS:  10,121 


STEP  C:  SPLIT  FOR  Q11  TEXT  (NOTHER  TONGUE) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q11TXT 
LRECL:  72 

OUTPUT  FILE(S):     SPEC.ICT9311.PF.RAUPH1.a11TXT.N0NBLK  • 
LRECL:     72 
*     FILE  SENT  FOR  AIITOCODING 

:  SPEC. ICT9311.RAUPH1.Q11TXT. BLANKS 


#  RECORDS:  10,145 


#  RECORDS:  883 


#   RECORDS:  9,262 


STEP  D:  SPLIT  FOR  012  TEXT  (PLACE  OF  BIRTH) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q12TXT 
LRECL:  72 

OUTPUT  FILE(S):     SPEC.ICT9311.PF.RAWPH1.Q12TXT.N0NBLK  • 
LRECL:     72 
*     FILE  SENT  FOR  AIITOCODING 

:  SPEC. ICT9311 .RAWPH1 .Q12TXT. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  950 

#  RECORDS:  9,195 


STEP  E:  SPLIT  FOR  Q13  TEXT  (CITIZENSHIP) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q13TXT 
LRECL:  72 

OUTPUT  FILE(S):  SPEC.ICT9311.PF.RAUPH1.Q13TXT.N0NBLIC  * 
LRECL:  72 
•  FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT931 1 .RAUPH1 .Q13TXT. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  299 

#  RECORDS:  9,846 


STEP  F1:  SPLIT  FOR  016  TEXT  1  (ETHNIC  ORIGIN) 

INPUT  FILECS):  SPEC.ICT9311 .PF.RAUPH1.Q16TX1 
LRECL:  72 

OUTPUT  FILECS):     SPEC.ICT9311.PF.RAUPH1.Q16TX1.N0NBLIC  * 
LRECL:     72 
*     FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT9311.RAWPH1.Q16TX1. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  8,838 

#  RECORDS:   1,307 
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STEP  F2:     SPLIT  FOR  Q16  TEXT  2  (ETHNIC  ORIGIN) 

INPUT  FILE(S):     SPEC.ICT9311.PF.RAUPH1.016TX2 
LRECL:     72 

OUTPUT  FILE(S):     SPEC.ICT9311.PF.RAIJPH1.Q16TX2.N0NBIJC  * 
LRECL:     72 
•     FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT9311.RAWPH1.Q16TX2. BLANKS 


#  RECORDS:      10,  US 

#  RECORDS:     3,262 

#  RECORDS:     6,883 


STEP  F3:  SPLIT  FOR  016  TEXT  3  (ETHNIC  ORIGIN) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAUPH1.Q16TX3 
LRECL:  72 


#  RECORDS:  10,145 


OUTPUT  FILE(S}:  SPEC.ICT9311.PF.RAUPH1.Ql6TX3.NqNBLIC 
LRECL:  72 
•  FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT931 1 .RAUPH1 .Q16TX3. BLANKS 


#  RECORDS:  1,334 


#  RECORDS:  8,811 


STEP  G:  SPLIT  FOR  018  TEXT  (RACE) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q18TXT 
LRECL:  72 

OUTPUT  FILE(S}:  SPEC.ICT9311.PF.RAyPH1.018TXT.NaNBLK  • 
LRECL:  72 
*  FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT9311 .RAWPH1 .Q18TXT. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  98 

#  RECORDS:  10,047 


STEP  H:  SPLIT  FOR  019  TEXT  (INDIAN  BAND) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q19TXT 
LRECL:  72 

OUTPUT  FILE(S):  SPEC.ICT9311.PF.RAUPH1.019TXT.NaNBLIC  • 
LRECL:  72 
*  FILE  SENT  FOR  AUTOCODING 

:  SPEC. ICT9311.RAWPH1.Q19TXT. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  46 

#  RECORDS:  10,099 


STEP  I:  SPLIT  FOR  024  TEXT  (LANGUAGE  OF  EDUCATION) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1 .Q24TXT 
LRECL:  72 

OUTPUT  FILE(S):  SPEC.ICT9311.PF.RAyPH1.024TXT.N0NBLK  * 
LRECL:  72 


#  RECORDS:  10,145 


#  RECORDS:  59 
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•  FILE  SENT  FOR  AUTOCODING 

:  SPEC . I CT931 1 . RAWPH 1 . Q24TXT . BLANKS 


#  RECORDS:  10,086 


STEP  J:  SPLIT  FOR  Q41  TEXT  (LANGUMGE  OF  UORK) 

INPUT  FILE(S):  SPEC.ICT9311 .PF.RAWPH1 .Q41TXT 
LRECL:  72 

OUTPUT  FILE(S):     SPEC.ICT9311.PF.RAUPH1.Q41TXT.N0NBLIC  * 
LRECL:     72 
•     FILE  SENT  FOR  MITOCODING 

:  SPEC. ICT9311.RAWPH1.Q41TXT. BLANKS 


#  RECORDS:   10,145 


#  RECORDS:  78 


#  RECORDS:  10,067 


STEP  K:  SPLIT  FOR  022  TEXT  (NOBILITY  OUTSIDE  CANADA) 

INPUT  FILE(S):  SPEC. I CT9311.PF. RAWPH 1.Q22TXTOC 
LRECL:  72 

OUTPUT  FILE(S):     SPEC.ICT9311.PF.RAIIPH1.a2ZTXT0C.N0NBLIC  • 
LRECL:     72 
*     FILE  SENT  FOR  AUTOCODING 

:  SPEC . I CT93 1 1 . RAWPH 1 . Q22TXTOC . BLANKS 


#  RECORDS:  10,145 


#  RECORDS:  179 


#  RECORDS:  9,966 


STEP  L:  SPLIT  FOR  022  TEXT  (NOBILITY  INSIDE  CANADA) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q22TXTIC 
LRECL:  97 

OUTPUT  FILE(S):  SPEC.ICT9311.PF.RAUPH1.022TXTIC.NaNBLK  * 
LRECL:  97 
•  FILE  SENT  FOR  AUTOCODING 

:  SPEC. I CT9311. RAWPH 1.Q22TXT I C. BLANKS 


#  RECORDS:  10,145 

#  RECORDS:  1,540 

#  RECORDS:  8,605 


STEP  N:  SPLIT  FOR  042  TEXT  (PLACE  OF  UORIC) 

INPUT  FILE(S):  SPEC.ICT9311.PF.RAWPH1.Q42TXT 
LRECL:  97 

OUTPUT  FILECS):  SPEC.ICT9311.PF.RAUPH1.042TXT.N0NBLK  * 
LRECL:  163 
•  FILE  SENT  FOR  AUTOCODING  BUT 

DUE  TO  SHORTAGE  OF  RESOURCES  PLACE  OF  UORK  UNIT 
IMS  UNABLE  TO  PERFORN  NANUAL/INTERACTIVE  CODING 

:  SPEC . I CT93 1 1 . RAWPH 1 . Q42TXT . BLANKS 


#  RECORDS:  10,145 


#  RECORDS:  4,058 


#  RECORDS:  6,087 


NAINFRANE  FILES  RETURNED  FRON  AUTOCODING: 
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SPEC. ICT9311.NCTAC.LANQ09. FINAL. ESS  --  1,028  records 
SPEC. I CT9311.NCTAC.LANQ11. FINAL. ESS  ---  883  records 
SPEC. ICT9311.NCTAC.PCTQ12. FINAL. ESS  ---  950  records 
SPEC. ICT9311.NCTAC.PCTQ13. FINAL. ESS  ---  299  records 

SPEC. ICT9311.ET0Q16. FINAL. ESS  13,434  records 

SPEC. ICT9311.ET0Q18. FINAL. ESS 98  records 

SPEC. ICT9311.INDQ19. FINAL. ESS  46  records 

SPEC. ICT9311.M5PQ22T1. FINAL. ESS  1,540  records 

SPEC. ICT9311.M5PQ22T2. FINAL. ESS  179  records 

SPEC. I CT9311.LANQ24. FINAL. ESS  59  records 

SPEC. I CT9311.LANQ41. FINAL. ESS  78  records 


UBIIX  PROCESSING  FOB  EFS  AFTER  PREDIT 

■SPEC.ICT9311.EFS.PREDIT.NaNTEXT" 

DOWNLOADED  TO  UNIX 

"CAPS/C0D/OLDPRGD/EFS/EXCLO46.DAT.FEB01" 


UNIX  PROCESSING  (in  SAS) 


EFS   FLOW  OF  FILES  AFTER  PREDIT 


INPUT:   ../../OLDPROD/EFS/PREDIT.EXCLQ46.DAT.FEB01  (10,087) 


Pm  Function; 

linking  codes  received 
from  auto-coding  including 
ethnic  and  language  manual 
resolution. 


CODLNK.PGM 
C0DLNK.LOG 
CODLNK.OUT 


OUTPUT:   ../../PROO/EFS/CODLNK. DVS953.DAT. 00. HAR02  (10,087) 


INPUT: 


./../PROD/EFS/COOLNK. DVS953.DAT. 00. MAR02  (10,087) 


Pga  function: 

eliminating  Personid  >01 
that  were  temporary 
residents  and  missed 
in  the  earlier  stages. 


NONTEMP.PGM 
N0NTEHP.LOG 
NONTEHP.OUT 


OUTPUT:  ../../PROO/EFS/NONTEHP. DVS953.DAT. 01 .MAR16  (9,986) 
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INPUT: 


. / . . /PROO/E  FS/NONTEMP . DVS953 . DAT . 0 1 . MAR 1 6  ( 9 . 986 ) 


Pai  Function: 

Fixing  Q46(inconie  question) 
Program  written  by  Amir  Ranjbar 
and  also  establishing  Multiple 
Error  Flags  for  all  questions 
that  had  multiple  responses. 


Q46MEF.PGM.MAR18 
Q46MEF.LOG. MAR 18 
Q46MEF.0UT.MAR18 


OUTPUT: 


./../PR00/EFS/Q46MEF. DVS953.DAT. 02. MARIS  (9,986) 


INPUT:   ../../PROD/EFS/Q46MEF. DVS953.DAT. 02. MARIS  (9,986) 

+ 
../../PROD/NCT/OVLANG. DVS953.DAT. 02. FEB28  (40,915) 


Pai  Function; 

Loading  date  of  birth,  sex 
DVAGE2  from  NCT  to  EFS  when 
they  are  blank,  for  they 
were  imputed  on  the  NCT  file. 


LNKBIR. PGM. MARIS 
LNKBIR.LOG. MARIS 
LNKBIR. OUT. MARIS 


OUTPUT:  ../../PROO/EFS/LNKBIR. DVS953.DAT. 03. MARIS  (40,662) 


../../PR00/EFS/LNKBIR.DVS953. OAT. 03. MARIS  (9,986) 


Pm  Figiction: 

Creating  derived  variables 
for  language  questions. 


DVLANG. PGM. MARIS 
DVLANG.LOG. MARIS 
DVLANG. OUT. MARIS 


.  / . .  /PROD/E  FS/D VLANG .  D VS953 .  DAT .  04 .  MAR  1 8  ( 9 ,  986 ) 


../../PROD/EFS/DVLANG.DVS953.0AT.04.MAR1S  (9,986) 


Pqw  figicticn: 

Recoding  Assignment  #  for 
certain  Rotation  Group  and 
recoding  PSU  #  "35087" 
instead  of  "85087"  if 
Rotation  Group='0' 


ASSPSU.PGM.MAR21 
ASSPSU.L0G.MAR21 
ASSPSU.0UT.MAR21 


. ./. ./PROD/EFS/ASSPSU. DVS953.DAT. 05. MARIS  (9,986) 


. ./. ./PROO/E FS/ ASSPSU.DVS953.DAT. 05. MAR 18  (9,986) 

PiW  Fwiction; 

LBFORCE.PGM.MAR30 
LBFORCE.LOG.MAR30 
LBFORCE.OUT.MAR30 


./. ./PROO/EFS/LBFRCEDVS. DVS953.DAT. 06. MAR28  (9,986) 
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../../PROO/EFS/LBFRCEDVS. DVS953.DAT. 06. MAR28  (9,986) 
Pw  FiMiction:  | 


I   (  file  produced  by  Norm  Crainpton  on  mainframe,   to  create  ethnic  DVs) 

V 


../../PROO/EFS/ETHDVS. DVS953.DAT. 07. APR25  (9,986) 


. ./. , /PROO/EFS/ETHDVS. 0VS953. DAT. 07.APR25  (9,986) 


FIXQ46.PGM.HAY06 
FIXQ46.LOG.MAY06 
FIXQ46.0UT.MAY06 


Pgi  Faction; 

Picking  a46AAmt  from  the  raw 
file  (length  20  bytes),  for 
in  the  Predit  only  19  bytes  of 
Q46AAmt  were  picked  up  and  the 
20th  byte  was  garbage(just  for 
Q46AAMT)  then  ran  Q46HEF.PGM 
again,  against  it. 


./../PR00/EFS/FIXQ46. DVS953.DAT. 08. MAY06  (9,986) 


./. ./PR0D/EFS/FIX046. DVS953.DAT. 08. MAY06  (9,986) 


101 


Pgw  FMiction: 

Rerun ing  046MEF.PGM 

for  Q46AAint  only  and  for 

related  multiple  flags. 


Q46MEF.REV.PGM 
Q46HEF.REV.LOG 
Q46HEF.REV.0UT 


../../PR0D/EFS/Q46MEF. DVS953.DAT. 09. MAY06  (9,986) 


./../PROO/EFS/Q46f1E F.DVS953.DAT. 09. MAY06  (9,986) 
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Pm  Function; 

Creating  income  derived 
variables:  wages,  investment 
UIC,  etc 


INCOME. PGM. MAY 11 
I NC0ME.LOG. MAY 11 
INCOME. OUT. MAY 11 


./../PROD/EFS/INCOME. DVS953.DAT. 10. MAY11  (9,986) 


../../PROD/EFS/INCOME. DVS953.DAT. 10. MAY11  (9,986) 


Pai  Function; 

Fixing  language  DVs  and  adding 
121   the  AGEGP  and  SEX  variables 

and  making  sure  SEXFLG  has  blanks 
instead  of  '.'  (fixed  by  Phil) 


FIX.DVLANG.PGM 
FIX.DVLANG.LOG 
FIX.DVLANG.OUT 


. . / . . /PRCO/E  FS/RE VDVLANG . D VS953 . DAT . 1 1 . MAY 1 1  ( 9 , 986 ) 
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../../PROD/EFS/REVDVLANG. DVS953.DAT. 11. MAY 11   (9,986) 


Pm  Function; 

recoding  HLDSIZE 


MIDSIZE. PGM 
HLDSIZE.LOG 

from  '  1'  '  2'  etc.  v  HLDSIZE. OUT 

to  '01'  '02'  etc., 

WAGES  &  TOT  INC 


13! 


../.. /PROD/EFS/HLDSIZE. DVS953.DAT. 12. MAY13  (9,986) 

mVED  TO 
../../OLDPROD/EFS/HLDSIZE. DVS953.DAT. 12. MAY13.Z   (9,986) 

REMMED  TO 
. ./. ./PR0D/EFS/INC0ME.DVS953.DAT.08.APR0S     (9,986) 

UPLOADED  TO 
SPEC.ICT9311.EFS.APR14.PUBREAD  (9,986) 


14! 


. ./. ./PROO/EFS/INCOME. DVS953.DAT. 08. APR05  (9,986) 

Pai  Fwiction:  |  SANDRA. DVCHECK. PGM 

revised  DV's  of  |  SANDRA.DVCHECK.LOG 

Sandra  Swain  v  SANDRA. DVCHECK. OUT 
written  by  Phil 


../../PROO/EFS/DV. REVISED.DAT. 13. MAY20  (9,986) 

NOVED  TO 
../../OLDPROD/EFS/DV. REVISED. DAT. 13. MAY20  (9,986) 

REIUUCD  TO 
../../PRa>/EFS/INCaHE.DVS953.DAT.08.APR05     (9,986) 

UPLOADED  TO 
SPEC.I(n'9311.EFS.APR14.PUBREAD     (9,986) 
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NCT     DATAFLOW 


UNIX  PROCESSING 
NCT  RAW  FILE 


NCT  BRANCH 

STEP  01: 

SAS  PROGRAM:  CAPS/COO/PROO/EFS/VLDPIO.PGH 

SAS  LOG:  CAPS/COD/PROD/EFS/VLDPID.LOG 

SAS  OUTPUT:  CAPS/CCD/PROD/EFS/VLDPID.OUT 

SAS  CARD(S): 

INPUT  FILE(S):  CAPS/C00/PR00/EFS/RAW1728. COMPLETE. DAT. 00. JAN26     (47,057) 

OUTPUT  FILE(S):  CAPS/C00/PR00/EFS/RAW1728. VLDPID.DAT. 00. FEB10      (40,917) 


STEP  02: 

SAS  PROGRAM:  CAPS/COD/PROO/EFS/RAUNTEHP.PGH 

SAS  LOG:  CAPS/COO/PROD/EFS/RAUNTEHP.LOG 

SAS  OUTPUT:  CAPS/COD/PROD/EFS/RAWNTEMP.OUT 
SAS  CARD(S): 

INPUT  FILE(S):  CAPS/C00/PR00/EFS/RAU1728. VLDPID.DAT. 00. FEB10      (40,917) 

+ 

:  CAPS/C00/PR0D/EFS/RAW1728. COMPLETE. DAT. 00. JAN26     (246  ) 

OUTPUT  FILE(S):  CAPS/C00/PR00/EFS/RAWN1728. NONTEMP.DAT. 02. MAR17     (40,   ) 

I 
I 
I 

UPLOADED  TO  MAINFRAME 
I 
I 
I 
I 

V 


'SPEC.  ICT9311  .RAU1728.IICNTEMP.NAR17' 


(Prograi  is  reaoving  all  difilicate  records  and  all  nan_teaporary  residence) 


NCT  BRANCH 


MAINFRAME  PROCESSING: 


FIRST  PROGWW  :     SPEC.ICT9311.SRCECDUPSEQ2) 

INPUT  FILE(S):     SPEC.ICT9311.RAWNCT.BiajP0126.PUBREAD  #  RECORDS:     47,057 
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OUTPUT  FILE(S):  SPEC.ICT9311 .RAWNCT.DUPS 

SPEC. ICT9311.RAWNCT. UNIQUE 

(  LRECL:  1728,  BLKSIZE:  8640  ) 

(Used  SORT  &  PL1  to  create  two  files:  DUPLICATE  RECORD  FILE  &  UNIQUE  RECORD  FILE) 
(Program  is  checking  duplicate  records) 


#  RECORDS:  86 

#  RECORDS:  46,971 


SECOND  PROGRAH;  SPEC.1CT9311.SRCECFSPLIT3) 

INPUT  FILE(S):  SPEC.ICT9311. RAWNCT.DUPS 


OUTPUT  FILE(S): 


SPEC. ICT9311. RAWNCT.DUPS. DROP 
SPEC . I CT9311 . RAWNCT . DUPS . KEEP 
(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS:  86 

#  RECORDS:  30 

#  RECORDS:  56 
(28  DUPLICATES) 


(Program  is  spliting  households  that  could  not  be  manually  resolved  vs.  those  that  can  be  resolved.  30  vs  50 
respectively,  set  of  28  duplicates  will  be  merged  to  the  UNIQUE  File). 


THIRD  PROGRAII:  SPEC.ICTWII.SRCECRECCPE) 

INPUT  FILE(S):  SPEC.ICT9311.RAWNCT.UNIQUE 

OUTPUT  FILE(S):  SPEC.ICT9311.RAWNCT.REC0DE.HHLD6.FEB3 
LRECL:  1728,  BLKSIZE=8640 


#  RECORDS:  46,971 

#  RECORDS:  46,971 


(14  records  were  receded  using  PGM=RECODED 
6  from  Phil's  frequency  dump,  where  PERSONID  =  '00'  but  should  have  been  '01' 
8  from  Lori's  memo  dated  January  21,  1994  "DOCUMENTATION  FOR  NCT  PHASE2);  records  with  PERSONID  12,13,14) 


FOURTH  PROGRAII:  (STEP1I     SI>EC.ICT9311.SRCE(VU)HERGE) 

INPUT  FILE(S):  SPEC. ICT9311. RAWNCT.DUPS. KEEP 
OUTPUT  FILE(S):  SPEC.ICT9311.RAWNCT.DUPUNIQ 


#  RECORDS:  56 

#  RECORDS:  28 


INPUT  FILE(S): 


CSTEP2)  SPEC. 1019311 .SRCECVLDMERGE) 

SPEC . I CT93 1 1 . RAWNCT . DUPUN I Q 
SPEC.ICT9311.RAWNCT.REC0DE.HHLD6.FEB3 


OUTPUT  FILE(S):  SPEC. ICT9311. RAWNCT. MERGE. DUPREC.FEB3 
(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS:  28 

#  RECORDS:  46,971 

#  RECORDS:  46,999 


(SAS  Program  is  picking  up  from  56  records(duplicate)  a  unique  record  by  using  IF>LAST>ID  function  of  SAS 
and  then  concatenating  the  .DUPUNIQ.  file  with  the  recoded  .HHLD6.FEB3.  using  DBSCOPY. 


46,999 


46,960 


FIFTH  PROGRAM  (STEP  1>:  SPEC.ICT9311.SRCE(FSPLIT2) 

INPUT  FILE(S):  SPEC. ICT9311. RAWNCT. MERGE. DUPREC.FEB3 


OUTPUT  FILE(S):  SPEC. ICT9311. RAWNCT. NOHHLD. DATA 
:  SPEC. ICT9311. RAWNCT. HHLD. DATA 


#  RECORDS: 


#  RECORDS:  39 

#  RECORDS: 


(  LRECL:  1728,  BLKSIZE:  8640  ) 
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(STEP  2);  SPEC.1CT9311.SRCECFSPIIT21 

INPUT  FILE(S):  SPEC. ICT9311 .RAUNCT.NOHHLD. DATA 

OUTPUT  FILE(S}:  SPEC. I CT9311. RAUNCT.NOHHLD. KEEP 

:  SPEC. I CT9311. RAUNCT.NOHHLD. DROP 

(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS:  39 


#  RECORDS: 

#  RECORDS: 


2 
37 


(Program  is  spliting  PERSONID  >  '00'  &  no  data  beyond  pos.  45; 
39  records  were  found  &  dimped;  2  out  of  39  were  selected  to  keep  and  merge  with  good  file). 


46,962 


(STEP  31;  SPEC.ICT9311.SRCE(FSPLIT2) 

INPUT  FILE(S):  SPEC. ICT9311.RAWNCT.HHLD. DATA 

:  SPEC. ICT9311. RAUNCT.NOHHLD. KEEP 


#  RECORDS: 

#  RECORDS:  2 


46,962 


OUTPUT  F I LE(S):  SPEC. ICT9311.RAUNCT. MERGE. DUPREC.FEB4  #  RECORDS: 

(  LRECL:  1728,  BLKSIZE:  8640  ) 
(Program  is  using  DBSCOPY  to  concatenate  2  above  files  together  and  produce  1  output  file) 


46,960 


46,716 


SIXTH  PROGRAM:  SPEC.ICT9311.SKCECST4ST61 

INPUT  FILE(S):  SPEC. ICT9311.RAUNCT. MERGE. DUPREC.FEB4 


OUTPUT  FILE(S):  SPEC.ICT9311.RAUNCT.UITH.ST4ST6.DR0P 
:  SPEC . I CT931 1 . RAUNCT .UI THOUT . ST4ST6 


#  RECORDS: 


#  RECORDS:  263 
U   RECORDS: 


(  LRECL:  1728,  BLKSIZE:  8640  ) 


(Program  is  spliting  the  good  file  into  2  files 
--  one  with  step4  =  3  or  step6  =  5  (246  records) 
-•  one  without  step4  =  3  or  step6  =5; 

(temporary  residents  and  usual  place  elsewhere  in  Canada) 

NOTE:  17  RECORDS  HAD  BOTH  STEP4  =  3  &  STEP6  =  5;  THAT  ACCOUNTS  FOR  THE  DOUBLE  COUNT  IN  263  RECORDS. 


46,716 

40,934 
5,782 


SEVENTH  PROaWI;  SPEC.ICT9311.SRCE(PERD01 

INPUT  FILE(S):  SPEC. ICT9311 .RAWNCT.UITH0UT.ST4ST6 

OUTPUT  FILE(S):  SPEC. ICT9311 .RAUNCT. VLDPERID 
:  SPEC. ICT9311. RAUNCT. PERIDOO 
(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS: 

#  RECORDS: 

#  RECORDS: 


(Program  is  spliting  PERSONID  ='00'  into  a  separate  file  &  keeping  the  rest  of  the  records  into  a  VLDPERID 
file  &  dunping  a  handful  of  records  to  see  the  actual  records  with  PERSONID  =  '00'.) 
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40,917 


EIGHTH  ROGRAM:  SPEC.lCTqBII.SRCECPOSTALBH 

INPUT  FILE(S):  SPEC.ICT9311.RAUNCT.VLDPERID 

OUTPUT  FILE(S):  SPEC. ICT9311.RAUNCT.VLDPERID. BLANK 
:  SPEC.ICT9311.RAUNCT.VLDPERID.N0NBLK 

(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS:40,934 

#  RECORDS:   17 

#  RECORDS: 


(Program  is  spliting  input  file  into  2  output  files) 

--  one  which  has  no  information  beyond  Postal  Code  eg.  nothing  in  Pos.51  to  next  154  bytes,  these  are  being 

deleted  from  good  file. 
--  one  Mith  information. 


40,917 
40,915 


MIHTH  ROGRAM;  SPEC.ICT9311.SRCECDUPSEQ4) 

INPUT  FILECS):  SPEC.ICT9311.RAUNCT.VLDPERID.N0NBLK 

OUTPUT  FILE(S):  SPEC.ICT9311.RAUNCT.VLDPERID.UNIQ 
(  LRECL:  1728,  BLKSIZE:  8640  ) 


#  RECORDS: 


#  RECORDS: 


(Program  is  removing  2  duplicate  records  from  the  file) 

NOTE:  This  step  was  done  after  the  file  had  gone  through  the  PREDIT.  These  2  duplicatae  records  were  picked 

up  a  week  later  when  QSKITXT  was  sent  for  Auto-Coding. 


PREDIT  PROGRAM;  FIMAL  FILE  BEFORE  GOING  INTO  PREDIT 


TENTH  PROGRAM;  SPEC.ICT9311.SRCE(C0HPEDIT) 


247.197 


INPUT  FILE(S):  SPEC.ICT9311.RAWNCT.VLDPERID.N0NBLK 
:  SPEC.ICT9311.SRCE(PREEDIT6) 

OUTPUT  FILE(S):     SPEC.I(rr9311.HCT.PREDIT.N0NTEXT.FEB7 
Irect:     532,     blksize:     8512 

:     SPEC.ICT9311.NCT.PREDIT.TEXT.FEB7 

Irecl:     92,       blksize:     8832 


#  RECORDS:  40,917 
«  RECORDS;  40,917 
«  RECORDS: 


(PL1     program  is  going  through  the  PREDIT  step  and  creating  2  output  files; 
(1)..  TEXT  FILE  ,  Lrecl  =  92,  Blksize  =  8832 
(2)..  NON-TEXT  FILE,  Lrecl  =  532,  Blksize  =  8512) 


FILES  BEIWG  PREPARED  FOB  AUTOCOmWG: 


247,197 


PROGRAM  1;     SPEC.ICT9311.SRCE(S0RT4) 

INPUT  FILE(S):  SPEC. I CT9311.NCT. PREDIT. TEXT. FEB7 


#  RECORDS: 


7,022 

57,560 

12,225 

58,968 

6,041 

5,959 

2,105 


32,. 085 
20.890 
21,228 
20,262 


OUTPUT  FILE(S) 


LRECL  =  92,  BLKSIZE  =  8832 

SPEC. ICT9311.NCT.PREDIT. TEXT. S0RTED1  (Q09T1,  Q09T2,  Q09T3) 

SPEC.ICT9311.NCT.PRE0IT.TEXT.SORTED2  (Q16T1,  016T2,  Q16T3) 

SPEC.ICT9311.NCT.PREDIT.TEXT.SbRTED3  (Q22T1,  Q22T2,  Q22T3) 

SPEC.ICT9311.NCT.PREDIT.TEXT.SORTE04  (Q42T1-Q42T3,  042PC) 

SPEC. ICT9311.NCT.PREDIT. TEXT. S0RTED5  (Q11TX) 

SPEC . I CT931 1 . NCT . PRED I T . TEXT . S0RTED6  (Q1 2TX) 

SPEC. ICT9311.NCT.PREDIT. TEXT. S0RTED7  (Q13TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED8  (Q18TX) 
SPEC.ICT9311.NCT.PREDIT.TEXT.S0RTED9  (Q19TX) 
SPEC. ICT9311. NCT. PREDIT. TEXT. SORTED10  (Q24TX) 
SPEC. I CT93 11. NCT. PRED IT. TEXT. SORTED 11  (Q41TX) 
SPEC. I CT93 11. NCT. PRED IT. TEXT. SORTED 12  (002TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED13  (Q34TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED14  (Q35TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED15  (Q37TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED16  (Q38TX) 

SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED17  (Q43TX) 
SPEC. ICT9311. NCT. PREDIT. TEXT. S0RTED18  (Q46TA-Q46TK) 


(Using  SORT  to  create  18  output  files  according  to  the  required  fields  from  the  input  file) 


SUBSEOUEWT  PROGRAMCS)  TO  CREATE  TEXT  FILES  FOR  AUTOCOOIIIG 
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#  RECORDS: 

#  RECORDS: 
it   RECORDS: 

#  RECORDS: 

#  RECORDS: 

#  RECORDS: 

#  RECORDS: 

#  RECORDS:  670 

#  RECORDS:  588 

#  RECORDS:  138 

#  RECORDS:  447 

#  RECORDS:  613 

#  RECORDS: 

#  RECORDS: 

#  RECORDS: 

#  RECORDS: 

#  RECORDS:  396 

#  RECORDS: 


SPLIT  FOR  Q09  TEXT  COIOULEDGE  OF  UWGUAGE) 


INPUT: 


SPEC. ICT9311. NCT. PREDIT. TEXT. SORTEDI 
SPEC. ICT9311.NCTAC.LANQ09. FINAL. ESS 


OUTPUT(I):  SPEC.ICT9311.PF.NCT.Q09TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. ICT9311.PF. NCT. Q09TXT.FORAC 
Lrecl  =  92,  Blksize  =  8832 


#  RECORDS:  7,022 

#  RECORDS:  1,028 

#  RECORDS:  875 


#  RECORDS:  6,147 


I 
I 
I 
I 

(1) 


SPEC. ICT9311.PF. NCT. Q09TX1.FORAC 
(#  RECORDS:  5,385) 


I 
(2) 
SPEC. ICT9311.PF. NCT. 009TX2.FORAC 
(#  RECORDS:  639) 


(3) 

SPEC.ICT9311.PF.NCT.Q09TX3. FORAC 
(#  RECORDS:  123) 


SPLIT  FOR  Q11  TEXT  CHOTHER  TOMGUE) 

I NPUT :  SPEC . I CT931 1 . NCT . PRED I T . TEXT . S0RTED5 
SPEC. ICT9311.NCTAC.LANQ11. FINAL. ESS 


#  RECORDS:  6,041 

#  RECORDS:  883 


OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q11TXT.UITHAC 


#  RECORDS:  756 
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Irecl  =  99,  blksize  =  8811 

(2):  SPEC.ICT9311.PF.NCT.Q11TXT.F0RAC 
Irecl  =  92,  blksize  =  8832 

SPLIT  FOR  Q12  TEXT  fPlACE  OF  BIRTH) 

I NPUT :  SPEC . I CT93 1 1 . NCT . FRED IT . TEXT . S0RTED6 
SPEC. ICT9311.NCTAC.PCTQ12. FINAL. ESS 

OUTPUT(I):   SPEC.ICT9311.PF.NCT.Q12TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC.ICT9311.PF.NCT.Q12TXT.F0RAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  5,285 


#  RECORDS:  5,959 

#  RECORDS:  950 

#  RECORDS:  808 


#  RECORDS:  5,151 


SPLIT  FOR  Q13  TEXT  CCITIZEWSHIP) 

I NPUT :  SPEC . I CT931 1 . NCT . PRED I T . TEXT . SORTED? 
SPEC. ICT9311.NCTAC.PCTQ13. FINAL. ESS 

OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q13TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. ICT9311.PF. NCT. Q13TXT.F0RAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  2,105 

#  RECORDS:  299 

#  RECORDS:  237 


#  RECORDS:  1,868 


SPLIT  FOR  Q16  TEXT  jETHMIC  ORIGIM) 

INPUT 


SPEC.ICT9311.NCT.PREDIT.TEXT.S0RTED2 
SPEC. ICT9311.NCTAC.ET0Q16. FINAL. ESS 


OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q16TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. ICT9311.PF. NCT. Q16TXT.F0RAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  57,560 

#  RECORDS:  13,434 

#  RECORDS:  11,869 

#  RECORDS:  45,691 


(1) 
SPEC. ICT9311 .PF. NCT. Q16TX1 . FORAC 
(#  RECORDS:  31,338) 


I 
I 
I 

<2) 

SPEC. ICT9311.PF. NCT. Q16TX2. FORAC 
(#  RECORDS:  10,608) 


(3) 

SPEC. ICT9311.PF. NCT. Q16TX3. FORAC 
(#  RECORDS:  3,745) 


SPLIT  FOR  Q18  TEXT  (RACE) 


INPUT: 


SPEC . I CT931 1 . NCT . PRED I T . TEXT . S0RTED8 
SPEC. ICT9311.NCTAC.ET0Q18. FINAL. ESS 


OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q18TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 


#  RECORDS:  670 

#  RECORDS:  94 

#  RECORDS:  75 
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(2):  SPEC.ICT9311.PF.NCT.Q18TXT.F0RAC 
I  reel  =  92,  blksize  =  8832 


#  RECORDS:  595 


SPLIT  FOR  Q19  TEXT  (IMDIAM  BAMD) 

I NPUT :  SPEC . I CT931 1 . NCT . PRED IT . TEXT . S0RTED9 
SPEC. ICT9311.NCTAC.INDQ19. FINAL. ESS 

OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q19TXT.UITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. ICT9311.PF. NCT. Q19TXT.F0RAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  588 

#  RECORDS:  46 

#  RECORDS:  37 


#  RECORDS:  551 


SPLIT  FOR  Q22  TEXT  CHOBILITY  IMSIDE/OUTSIDE  CAMADA 


INPUT: 
OUTPUT: 


SPEC.ICT9311.RAUNCT.VLDPEID.N0NBLK 
SPEC. ICT9311. NCT. Q22TXT 


(40,917) 
(40,917) 


SPEC. ICT9311. NCT. Q22TXTIC 
(40,917) 


SPEC. ICT9311. NCT. Q22TXTIC.NONBLK 
(5,763) 


SPEC. ICT9311. NCT. Q22TXTIC. BLANKS 
(35,154) 


SPEC.ICT9311.NCT.Q22TXT0C 
(40,917) 


SPEC. ICT9311. NCT. Q22TXTOC. BLANKS 
(39,707) 
INPUT:  SPEC.ICT9311.Q22TXTIC.N0NBLK 
OUTPUT:  SPEC. ICT9311.022TXTIC. UNIQUE 


SPEC . I CT931 1 . NCT . Q22TXTOC . NONBLK 


(5,763) 
(5,761) 


INPUT:  SPEC. ICT9311.Q22TXTIC. UNIQUE        (5,761) 
SPEC. ICT9311 .NCTAC.M5PQ22T1 . FINAL. ESS  (1 ,540) 


OUTPUT:  SPEC. ICT9311.PF. NCT. Q22TXTIC.UITHAC 
Irecl  =  104,  blksize  =  8840 

SPEC. ICT9311.PF. NCT. Q22TXTIC.F0RAC 
Irecl  =  97,  blksize  =  8827 


(892) 

(4,869) 

I NPUT :  SPEC . I CT931 1 .Q22TXTOC . NONBLK 

OUTPUT:  SPEC. ICT9311.Q22TXT0C. UNIQUE 


(1,210) 


(1,210) 
(1,210) 


INPUT:  SPEC.ICT9311.Q22TXT0C.UNIQUE         (1,210) 
SPEC. ICT9311.NCTAC.PCTQ22T3. FINAL. ESS  (179) 


OUTPUT:  SPEC. ICT9311.PF. NCT. Q22TXTOC.WITHAC    (138) 
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SPLIT  FPU  Q42  TEXT  (PUCE  OF  WORK) 

INPUT:  SPEC. ICT9311.NCT.VLDPERID. UNIQUE 
OUTPUT:  SPEC.ICT9311.NCT.Q42TXT 


Irecl  =  99,  blksize  =  8811 

SPEC . I CT931 1 . PF . NCT . Q22TXT0C . FORAC 
Irecl  =  92,  blksize  =  8832 


(1.072) 


SPLIT  FOR  Q24  TEXT  (UWGUAGE  OF  EDUCATIOW) 

INPUT:  SPEC. ICT9311. NCT. PREDIT. TEXT. SORTED10 
SPEC.ICT9311.NCTAC.LANQ24.FINAL.ESS 

OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q24TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. I CT9311.PF. NCT. Q24TXT. FORAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  138 

#  RECORDS:  59 

#  RECORDS:  25 


#  RECORDS:  113 


SPLIT  FOR  041  TEXT  {LANGUAGE  OF  UORK) 

I NPUT :  SPEC .  I CT931 1 .  NCT .  PRED I T .  TEXT .  SORTED  1 1 
SPEC .  I CT931 1 .  NCT  AC .  LANQ41 .  F I NAL .  ESS 

OUTPUT(I):  SPEC. ICT9311.PF. NCT. Q41TXT.WITHAC 
Irecl  =  99,  blksize  =  8811 

(2):  SPEC. ICT9311.PF. NCT. Q41TXT. FORAC 
Irecl  =  92,  blksize  =  8832 


#  RECORDS:  447 

#  RECORDS:.  78 

#  RECORDS:  52 


#  RECORDS:  395 


INPUT:  SPEC.ICT9311, 

OUTPUT:  SPEC.ICT9311, 

:  SPEC.ICT9311, 

INPUT:  SPEC.ICT9311, 
:  SPEC.ICT9311, 
OUTPUT:  SPEC.ICT9311, 
Irecl  =  163, 

SPEC.ICT9311, 
Irecl  =  163, 


NCT.Q42TXT 
NCT.Q42TXT.N0NBLK 
NCT. Q42TXT. BLANKS 

NCT.a42TXT.N0NBLK 
PF.RAWPH1 .Q42TXT.NONBLK 
PF.NCT.al2TXT.UITH0AC 
blksize  =  8802 

PF. NCT. Q12TXT. FORAC 
blksize  =  8802 


#  RECORDS:  17,177 

#  RECORDS:   4,058 

#  RECORDS:   1,425 


#  RECORDS:  15,752 


UNIX  PROCESSIWG  FOB  HCT  AFTER  PREDIT 
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"SPEC.ICT9311.NCT.PREDIT.NCNTEXT.FEB7"     (40,917) 

DOWNLOADED  TO  UNIX 
-CAPS/C0O/STEVENS/NCT/NCT.PREDIT.EXCLQ46"   (40.917) 

UNIX  PROCESSING  (in  SAS) 

NCT   FLOW  OF  FILES  AFTER  PREDIT 


I NPUT :   . . / -  - /STEVENS/NCT/NCT . PRED I T . EXCLQ46 
+ 
. ./. ./PRAKNEE/YOBIMPUT.DAT 

Pai  FMiction; 

merging  Year  of  Birth  and 
imputation  flag  for  year  of  birth 
from  the  274  records  file 
created  by  Matthew  Briggs,  SSMD 
(Mho  manually  imputed  missing  y.o.b.) 


OUTPUT: 


. / . . /PRAKNEE/ YOB I MP . DAT 


. . / • • /praknee/Yobi  mput . pgm 
. ./. ./praknee/Yobi mput. log 


INPUT:   ../../PRAKNEE/YOBIHP.DAT 
+ 
. ./. ./PRAKNEE/SEXMSG.DAT 

Pga  Fmcticn: 

merging  Sex  and  imputed 
sex  flag  from  the  file 
created  by  Phil  (who  manually 
imputed  missing  SEX) 

OUTPUT:   ../../PRAKNEE/SEXFLG.DAT 


. / . . /praknee/Sexf I g2 . pgm 
. / . . /praknee/sexf I g2 . I og 


INPUT:   ../../PRAKNEE/SEXFLG.DAT 

Pgi  Ftgiction; 

Recoding  Assignment  #  for 
certain  PSUs  that  belong  to 
special  population  vs  labour 
force  sample. 

OUTPUT:   ../../PRAKNEE/RECDASS.DAT 


. . / . . /praknee/recdass . pgm 
. ./. ./praknee/recdass. log 
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INPUT: 


. / . . /PRAKNEE/RECDASS . DAT 


Pm  FMKtion: 

sas  program  to  exclude 
duplicate  records 


. ./. ./praknee/excldup.pgm 
. ./. ./praknee/excldup. log 


OUTPUT: 


./. ./PRAKNEE/EXCLDUP. DAT 


. ./. ./PRAKNEE/EXCLDUP. DAT 

Pgi  Function; 

4  records  had  sex  missing 
after  Phil's  receded  sex  pgm 
was  completed,  therefore 
sex  for  those  4  receded  in  this 
program. 


./. ./PRAKNEE/RECDSEX.DAT-- 


=  renamed 


I 

V 

. ./. . /PROD/NCT/RECDSX. PRE532.DAT. 00. FEB24 

Proarai  function: 

derived  age  is 

created  in  this  I    0VAGE2.PGM 

program  from  date  |    DVAGE2.LOG 

of  birth  by  I    DVAGE2.0UT 

Christian  Branconnier,  COD. 

Derived  variable  is 

DVAGE2  (pos.670,2) 

V 
. . / . . /PROO/NCT/D VAGE2 . DVS953 . DAT . 0 1 . FEB24 


./../PR00/NCT/DVAGE2. DVS953.DAT. 01. FEB24  (40,915) 


Proqraa  function; 

derived  language 

variable  are  created 

here  from  Q9 

KOL  (672,2)-  knowledge  of  language 

HLN  (674,1)-  home  language 

MTN  (675,2)-  mother  tongue 

LGEDUC(6?^, 1)  -  language  of  education 

LGUORK  (678,1)-  language  of  work 

OL  (679,1)  -  knowledge  of  official 

language 
NOL  (680,1)-  knowledge  of  non-official 

language 
0LN0L(681,1)-knowledge  of  official/ 

non-official  language 


DVLANG.PGM 
DVLANG.LOG 
DVLANG.OUT 


./../PROO/NCT/DVLANG. DVS953.DAT. 02. FEB28  (40,915 
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../../PROO/NCT/DVLANG. DVS953.DAT. 02. FEB28  (40,915) 

+ 
../../PR00/NCT/ST4ST6.DAT  (263) 


NONTEMP.PGM 
N0NTEHP.LOG 
NONTEMP.OUT 


Prograi  fmction: 

created  data  file  with  263  rec.  & 
3  variables-(hhldid,step4,step6) 
This  program  is  therefore 
removing  households  from  the 
DVLANG  file  that  have  temporary 
residences  through  merge  process 
by  eliminating  those  records 
that  were  in  the  st4st5  data  file. 


./. ./PROD/NCT/NONTEMP. 0VS953.DAT. 03. MAR 16  (40,662) 


./. ./PROD/NCT/NONTEMP. DVS953.DAT. 03. MAR16  (40,662) 


Prograw  ftjiction: 

Q46  (income  question)  and 
Multiple  error  flags  for  all 
related  questions  have  been 
created  here  in  this  program 


Q46MEF.PGM.MAR18 
Q46MEF.LOG. MAR 18 
Q46MEF.0UT.MAR18 


../../PRO0/NCT/Q46MEF. DVS953.DAT. 04. MARIS  (40,662) 


../../PR00/NCT/Q46ME F.DVS953.DAT. 04. MAR 18  (40,662) 

+ 
. ./. ./PROO/EFS/DVLANG. DVS953.DAT. 04. MAR18  (9,986) 


Prograw  function: 

lOJ   creating  a  flag  (EFSFLG)  in 
the  NCT  file  for  those 
records  that  are  in  the  EFS 
file  (pos.669,1) 


EFSFLG. PGM. MARIS 
EFSFLG. LOG. MARIS 
EFSFLG. OUT. MARIS 


. ./. ./PROD/NCT/EFSFLG. DVS953.DAT. 05. MARIS  (40,662) 
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./. ./PROD/NCT/EFSFLG. DVS953.DAT. 05. MARIS  (40,662) 


Proqraw  ftnction: 

This  program  is  linking 
codes  from  autocoding 
back  in  the  NCT  file. 


CODLNK1.PGM.MAR30 
COPLNK1.LOG.MAR30 
C0DLNK1.OUT.MAR30 


. ./. ./PROO/NCT/COOLNK. DVS953.DAT. 06. MAR30  (40,662) 
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./. ./PROO/NCT/COOLNK. DVS953.DAT. 06. MAR30  (40,662) 


CLINKEFS.PGM.MAR30 
CLINKEFS.LOG.MAR30 
CLINKEFS.OUT.MAR30 


Program  fiMiction; 

12 i    linking  the  EPS  codes 
that  were  not  sent  for 
autocoding,  for  they  already  j 
had  a  code  (because  the  text| 
in  the  NCT  file  was  the  same  j 
as  in  the  EPS  file)         | 

V 
. ./. ./PROD/NCT/CLINKEPS. DVS953.DAT. 07.MAR30  (40,662) 


r./../PROO/NCT/CLINKEPS. DVS953.DAT. 07. MAR30  (40,662) 


13! 


Prograa  ftwction: 

linking  language  and  ethnic 
recedes  and  multiple  codes 
to  the  above  file. 


C0DLNK2.PGM.MAR30 
CODLNK2.LOG.MAR30 
C0DLNK2.0UT.MAR30 


./. ./PR00/NCT/C00LNK2. DVS953.DAT. 08. MAR30  (40,662) 


../../PR00/NCT/C0DLNK2. DVS953.DAT. 08. MAR30  (40,662) 


14! 


Prograw  function; 

receding  Assignment  &  PSU 
according  to  different 
rotation  group  (Lori's  request) 
(pos.12,5)  &  (pos.8,1) 


ASSPSU.PGM.MAR30 
ASSPSU.LOG.MAR30 
ASSPSU.OUT.MAR30 


. / . . /PROO/NCT/ASSPSU . D VS953 . DAT . 09 . MAR30  ( 40 , 662 ) 


../../PROO/NCT/ASSPSU. DVS953.DAT. 09. MAR30  (40,662) 


15! 


Prograw  function: 

creating  labour  market 
derived  variables  from 
pos  (790  to  804) 


LBPORCE.PGM.MAR30 
LBPORCE.LOG.MAR30 
LBPORCE.OUT.MAR30 


. . / . . /PROO/NCT/LBPORCE . D VS953 . DAT . 1 0 . MAR30  ( 40 , 662 ) 


../../PROD/NCT/LBFORCE. DVS953.DAT. 10. MAR30  (40,662) 


Proqraw  fiMiction; 

16j   weights  are  added  to  the 
Labour  force  file  in  this 
file  (pos. 925, 8). 


LBFORCE.WGTED.PGN.MAR30 
LBFORCE .WGTED . LOG.HAR30 
LBFORCE  .UGTED  .OUT  .MAR30 


33 


./../PR00/NCT/LBF0RCE.DVS953. WEIGHTED. DAT. 11.HAR30  (40,662) 


../../PROO/NCT/LBFORCE. 0VS953.WEIGHTED.DAT. 11. MAR30  (40,662) 
Pan  Function:       | 
17i  I   (file  produced  by  Norm  Crampton  on  mainframe,  to  create  ethnic  OVs) 

V 

../../PROO/NCT/ETHDVS. DVS953.DAT. 12. APR22  (40,662) 


./../PROO/NCT/ETHDVS. DVS953.DAT. 12. APR22  (40,662) 


FIXQ46.PGM.HAY02 
FIXQ46.LOG.NAY02 
FIXQ46.0UT.MAY02 


18! 


Pgi  FtMiction; 

Picking  a46AAmt  from  the  raw 
file  (length  20  bytes)  for 
in  the  Predit  only  19  bytes  of 
Q46AAmt  were  picked  up  and  the 
20th  byte  was  garbage,  in  order  to 
run  Q46MEF.PGM  again  just  for 
Q46AAmt  in  the  next  step. 


./. . /PR0D/NCT/FIXQ46. DVS953.DAT. 13. MAY02  (40,662) 


./. . /PR0D/NCT/FIXQ46. DVS953.DAT. 13. MAY02  (40,662) 
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Pqa  Function: 

Rerunning  Q46MEF.PGM  for 
Q46AAmt  only  and  for 
related  multiple  flags. 


Q46MEF.REV.PGM 
Q46MEF.REV.LOG 
Q46MEF.REV.0UT 


../../PR0D/NCT/Q46MEF. DVS953.DAT. 14. HAY02  (40,662) 


. / . . /PR00/NCT/Q46ME  F . DVS953 . OAT . 1 4 . MAR Y02  (40 , 662 ) 
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Pai  Ftwction; 

Creating  income  derived 

variables:  wages,  investment  income 

UIC,  total  income,  etc. 


INCOME. PGM. MAY02 
INCOME. LOG. MAY02 
INCOME. OUT. MAY02 


./../PROO/NCT/INCOME. DVS953.DAT. 15. MAY02  (40,662) 


./../PROD/NCT/INCOME. DVS953.DAT. 15. MAY02  (40,662) 


Pai  Function: 

Fixing  language  DVs  and  adding 
21 1   the  AGEGP  (pos. 666,2)  and  SEX 
(pos.668,1)  variables, 
(fixed  by  Phil) 


FIX.DVLANG.PGM 
FIX.DVLANG.LOG 
FIX.DVLANG.OUT 
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22! 


23 1 


V 
../../PROD/NCT/REVDVLANG.DVS953.DAT.15A.MAY04  (40,662) 


../../PROD/NCT/REVDVLANG.DVS953.DAT.15A.MAY04  (40,662) 

Pm»  Function; 

Cleaning  income  derived  variables 
like  wages,  self  einplyment,  UIC, 
with  '  '  instead  of  '.'  and 
Spiliting  the  file  into  2  parts 
the  Labour  force  file  and  the 
Special  Population  file. 

V 
../.■/PRCD/IICT/DVS953. LFSiWP.DAT. 16.IUY04  (32,696) 


INCOME. FIX1. PGM 
INC0ME.FIX1.LOG 
INCOME. FIX1 .OUT 


. ./. ./PR0D/NCT/PART1 .MAY04  (10,000) 

. ./. ./PROO/NCT/PART2.MAY04  (10,000) 

. . / . . /PR00/NCT/PART3 . MAY04  (10,000) 

../../PR00/NCT/PART4.HAY04  (  2,696) 


../../PRaO/NCT/SPECPaP.DVS953.DAT.17.NAY04  (7.966) 

../../PRO0/NCT/PART5.MAY04  (  7,966) 


LABOUR  FORCE  FILE 

../../PRaO/NCT/DVS953. LFSAMP.DAT. 16. MAY04  (32,696) 

RENAMED  TO 

../../PRaO/NCT/DVS953. LFSAMP.DAT. 14. APR08  (32,696) 
(in  order  to  stick  with  the  file  name  SM  people  had  already  been  told  to  use) 

UPLOADED  TO  MAINFRAME  AS 

INPUT:    "SPEC.ICT9311.NCT.LFSAMP.APR14.PUBREAD" 

(  receding  the  HLDSIZE  from  blank  to  '0') 
I 
I 

V 
OUTPUT:   "SPEC. ICT9311.NCT.LFSAMP.HHSIZE. FIX" 

INPUT:    "SPEC. ICT9311.NCT.LFSAMP.HHSIZE. FIX" 

(  receding  Q46,  WAGES,  SELF_EMP,  &  INVEST_INC  ) 
I 

I 

V 

OUTPUT :   "SPEC . I CT93 1 1 . NCT . L  FSAMP . RECOOE . WAGES" 

I 

RENAMED  ON  MAINFRAME  TO 
■SI»EC.ICT9311.MCT!t.FSAHP.APR14.PtlBREAD" 

DOWNLOADED  TO  UNIX  AS 
../../PR00/NCT/DVS953. LFSAMP.DAT. 16. MAY16  (32,696) 
RENAMED  TO 

.  ./■■/l>R0D/IICT/p\^953.LFSAHP.pAT.14.*PR08  (32.696) 

(in  order  to  stick  with  the  file  name  SM  people  had  already  been  told  to  use) 
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NOTE:  PSU  RECOOE  WAS  DONE  ON  LFSAHP  FILE  BUT  NOTHING  WAS 
RECODED,  FOR  THERE  UAS  NO  PSU#  ='85087'  IN  THE  FILE 
TO  BE  RECODED.  HOWEVER.  THE  STEP  UAS  DONE  IN  ORDER  TO 
CHECK  ITS  EXISTENCE  AND  THERFORE  DOCUMENTED  IN 
REVPSU. PGM/RE VPSU. LOG/RE VPSU. OUT  BUT  UAS  NOT  CARRIED 
OUT  (PROCESSED). 
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../../PROD/NCT/DVS953.LFSAMP.DAT.14.APR08  (32,696) 
Pai  Fmction: 

LABFRCE.DVCHECK.PGM 
LABFRCE.L0G.NAY19 
V  LABFRCE.0UT.MAY19 

,/../PROD/NCT/LABFRCE. DV.REVISED.DAT. 17. MAY19  (32,696)- 
I 

MOVED  TO 


./. ./OLDPROD/NCT/LABFRCE.DV. REVISED. DAT. 17.MAY19.Z  (32,696) 
I 

RENAMED  TO 


.  ./■■/PR0D/IICT/DVS953.LFSW>.DAT.14.APR0a  C32.696)     < 

(still  sticking  with  the  tile  name  SM  people  had  already  been  told  to  use) 


SPECIAL  POPULATION  FILE 
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../../PROD/NCT/SPECPOP.DVS953.0AT.17.MAY04  (7,966) 

RENAMED  TO 

../../PROO/NCT/SPECPOP. DVS953.DAT. 15. APR 14  (7,966) 
(in  order  to  stick  with  the  file  name  SM  people  had  already  been  told  to  use) 

UPLOADED  TO  MAINFRAME  AS 

INPUT:    "SPEC.ICT9311.NCT.SPECP0P.APR14.PUBREAD" 
(  recoding  the  HLDSIZE  from  blank  to  '0') 

I 
I 
I 

V 

"SPEC. ICT9311.NCT.SPECP0P.HHSIZE. FIX" 


OUTPUT: 


INPUT:    "SPEC. ICT9311.NCT.SPECP0P.HHSIZE. FIX" 

(  recoding  046,  UAGES,  SELF_EMP,  &  INVEST_INC  ) 
I 
I 
I 
I 

V 
OUTPUT:   "SPEC. ICT9311.NCT.SPECP0P. RECOOE. UAGES" 
I 

RENAMED  ON  MAINFRAME  TO 
I 

■SPEC.ICT9311.tlCT.SPECP0P.APR14.PUBKEAD' 

after  deleting  the  one  created  before  under  the  same  name 
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(in  order  to  stick  with  the  file  name  SM  people  had  already  been  told  to  use) 

DOWNLOADED  TO  UNIX  AS 
../../PR00/NCT/SPECP0P.DVS953.DAT.17.MAY16  (7,966) 
RENAMED  TO 

. ■/■ ./PR0D/HCT/SPECP0P.PVS953.PAT.15.APR14  C7.966) 

(still  sticking  with  the  tile  name  SM  people  had  already  been  told  to  use) 
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../../PROO/NCT/SPECPOP. DVS953.DAT. 15. APR14  (7,966) 
Proqraw  ftiiction; 

REVPSUSP.PGM 
REVPSUSP.LOG 
V       REVPSUSP.OUT 
./../PR0D/NCT/REVPSU.SPECP0P.DAT.17.MAY18  (7,966)- 

I 
I 

MOVED  TO 


./../0LDPR00/NCT/REVPSU.SPECP0P.DAT.17.MAY18  (7,966) 


RENAMED  TO 


./../PR(P/IICT/SPECP0P.DVS953.DAT.15.APR14  (7.9661  <- 


../../PROD/NCT/SPECPOP. DVS953.DAT. 15. APR 14  (7,966) 


SPECPOP.DVCHECK.PGM 

SPECPOP.LOG.MAY20 

SPECPOP.OUT.MAY20 
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Prograw  fmction:  } 

Corrections  to  | 

LF  DVs  for  Sandra       j 

Swain.  Pgm  by  Phil      | 

V 

../.. /PROD/NCT/SPECPOP. DV.REVISED.DAT. 18. MAY20  (7,966) 

I 

MOVED  TO 
I 

../../OLDPROO/NCT/SPECPOP. DV.REVISED.DAT. 18. MAY20  (7,966) 
RENAMED  TO 


. ./../PK0D/N(n^/SPECP0P.DVS953.DAT.15.APR14  (7.966)  <- 


<^ 
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